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Abstract 

We propose a model for deterministic distributed function computation by a network of identical and anonymous 
nodes, with bounded computation and storage capabilities that do not scale with the network size. Our goal is 
to characterize the class of functions that can be computed within this model. In our main result, we exhibit a 
class of non-computable functions, and prove that every function outside this class can at least be approximated. 
The problem of computing averages in a distributed manner plays a central role in our development. 



1 Introduction 

■ The goal of many multi-agent systems, distributed computation algorithms and decentralized data fusion methods 
C/2 \ is to have a set of nodes compute a common value based on initial values or observations at each node. Towards this 

, ^/ purpose, the nodes, which we will sometimes refer to as agents, perform some internal computations and repeatedly 
\ communicate with each other. Let us consider some examples. 

. (a) Quantized consensus: Suppose that each agent begins with an integer value Xi(0)& {0, . . . ,K}. We would 

■ like the agents to end up, at some later time, with values yi that are almost equal, i.e., |y, — yj\ < 1, for all i, j, 
while preserving the sum of the values, i.e., Yh=i x i(0) = Yh=i Hi- This is the so-called quantized averaging problem 

0^ ■ which has received considerable attention recently; see P__l IH1 131 HO] ■ It may be viewed as the problem of computing 
the function (1/n) __]™_i x ii rounded to the nearest integer. 

(b) Distributed hypothesis testing: Consider n sensors interested in deciding between two hypotheses, H and 
H\. Each sensor collects measurements and makes a preliminary decision x^, 6 {0, 1} in favor of one of the hypotheses. 
The sensors would like to make a final decision by majority vote, in which case they need to compute the indicator 
function of the event X^™=i x i — m a distributed way. Alternatively, in a weighted majority vote, they may be 
interested in computing the indicator function of the event X_i=i x * — 3n/4. 

(c) Solitude verification: This is the problem of verifying that at most one node in the network has a given state. 
This problem is of interest if we want to avoid simultaneous transmissions over a common channel [13] . or if we 
want to maintain a single leader (as in motion coordination — see for example [15] ) Given K possible states, so that 
Xi £ {1, . . . , K}, solitude verification is equivalent to the problem of computing the binary function which is equal 
to 1 if and only if \{i : Xi = 1}| = 1. 

There are numerous methods that have been proposed for solving problems such as the above. (See for example the 
vast and growing literature on consensus and averaging methods.) Oftentimes, different algorithms involve different 
computational capabilities on the part of the agents, which makes it hard to talk about "the best" algorithm. At 
the same time, simple algorithms (such as setting up a spanning tree and aggregate information by progressive 
summations over the tree) are often "disqualified" because they require too much coordination or global information. 
One then realizes that a sound discussion of such issues requires the specification of a precise model of computation, 
followed by a systematic analysis of fundamental limitations under any given model. This is precisely the objective 
of this paper: to propose a particular model, and to characterize the class of computable functions under this model. 

Our model provides an abstraction for the most common requirements for distributed algorithms in the sensor 
network literature. It is somewhat special because (i) it does not allow for randomization; (ii) it does not address 
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the case of time- varying interconnection graphs; such extensions are left for future research. Qualitatively speaking, 
our model includes the following features. 

Identical agents: Any two agents with the same number of neighbors must run the same algorithm. 

Anonymity: An agent can distinguish its neighbors using its own, private, local identifiers. However, agents do not 
have global identifiers. 

Absence of global information: Agents have no global information, and do not even have an upper bound on 
the total number of nodes. Accordingly, the algorithm that each agent is running is independent of the network size 
and topology. 

Convergence: Agents hold an estimated output, and this estimate must converge to a desired value which is 
generally a function of all agents' initial observations or values. In particular, for the case of discrete outputs, all 
agents must eventually settle on the desired value. On the other hand, the agents do not need to be aware of such 
termination, which is anyway impossible in the absence of any global information [6]. 

1.1 Goal and Contribution 

We provide in this paper a general model of decentralized anonymous computation with the above described features, 
and characterize the type of functions of the initial values that can be computed. To keep our model simple, we 
only consider deterministic and synchronized agents exchanging messages on a fixed bi-directional network, with no 
time-delays or unreliable transmissions. Agents are modelled as finite automata, so that their individual capabilities 
remain bounded as the number of agents increases. 

We prove that if a function is computable under our model, then its value only depends on the frequencies of the 
different possible initial values. For example, if the initial values Xi only take values and 1, a computable function 
necessarily only depends on po : — |{* : x % — 0}|A* an d Pi '■= \{i '■ Xi = 1}|/"-- In particular, determining the number 
of nodes, or whether at least two nodes have an initial value of 1 is impossible. 

Conversely, we prove that if a function only depends on the frequencies of the different possible initial values 
(and is measurable), then the function can at least be approximated with any given precision, except possibly on 
a set of frequency vectors of arbitrarily small volume. Moreover, if the dependence on these frequencies can be 
expressed by a combination of linear inequalities with rational coefficients, then the function is computable exactly. 
In particular, the functions involved in the quantized consensus and distributed hypothesis testing examples are 
computable, whereas the function involved in solitude verification is not. Similarly, statistical measures such as the 
standard deviation and the kurtosis can be approximated with arbitrary precision. 

Finally, we show that with infinite memory, the frequencies of the different values (i.e., p$,pi in the binary case) 
are computable. 

1.2 Overview of previous work 

There is a large literature on distributed function computation in related models of computation. A common model 
in the distributed computing literature involves the requirement that all processes terminate when the desired output 
is produced. A consequence of the termination requirement is that nodes typically need to know the network size n 
(or an upper bound on n) to compute any non-constant functions. We refer the reader to [TJ O [3H [IZ1 for some 
fundamental results in this setting, and to |10| for an excellent summary of known results. 

Similarly, the biologically-inspired "population algorithm" model of distributed computation has some features 
in common with our model, namely finite-size agents and lack of a termination condition; see [2] for a very nice 
summary of known results. However, these models involve a somewhat different type of agent interactions from the 
ones we consider. 

We note that the impossibility of computing p\ without any memory was shown in |21j . Some experimental 
memorylcss algorithms were proposed in the physics literature [12] . Randomized algorithms for computing particular 
functions were investigated in [161 [7] . We also point the reader to the literature on "quantized averaging," which 
often tends to involve similar themes [9] [20] [3j [8] . 

Several papers quantified the performance of simple heuristics for computing specific functions, typically in 
randomized settings. We refer the reader to [14] and |29j . which studied simple heuristics for computing the majority 
function. A deterministic algorithm for computing the majority function (and some more generalized functions) was 
proposed in [25] . 

Semi-centralized versions of the problem, in which the nodes ultimately transmit to a fusion center, have often 
be considered in the literature, e.g., for distributed statistical inference [25] or detection [19]. The papers [11], [18], 
and |22j consider the complexity of computing a function and communicating its value to a sink node. We refer the 
reader to the references therein for an overview of existing results in such semi-centralized settings. 



Our results differ from previous works in several key respects: (i) Our model, which involves totally decentralized 
computation, deterministic algorithms, and constraints on memory and computation resources at the nodes, but does 
not require the nodes to know when the computation is over, does not seem to have been studied before, (ii) Our 
focus is on identifying computable and non-computable functions under our model, and we achieve a nearly tight 
separation. 

2 Formal description of the model 

The system consists of (i) a communication graph G — (V, E), which is bidirectional (i.e., if £ E, then (j, i) £ E); 
(ii) a port labeling whereby edges outgoing from node i are labeled by port numbers in the set {1, 2, . . . , dcgree(i)}; 
(hi) a family of finite automata (Ad)d=i,2, 3,... ■ (The automaton Ad is meant to describe the behavior of a node with 
degree d.) 

The state of the automaton Ad is a tuple (x, z, y, mi, . . . , md); we will call x £ X= {0, 1, . . . , K} the initial value, 
z £ Zd the internal memory state, y £ Y the output or estimated answer, and mi,...,m<j £ M the messages. 
The sets X, Y, Zd,M are assumed finite, unless there is a statement to the contrary. Furthermore, we assume that 
the number of bits that can be stored at a node is proportional to the node's degree; that is, logl-Z^ < Cd, for 
some absolute constant C. (Clearly, at least this much memory would be needed to be able to store the messages 
received at the previous time step.) We will also assume that is an element of the above defined sets Y, Zd, 
and M. The transition law Ad maps X x Zd x Y x M d to X x Zd x Y x M d : [x, z, y; (mi, . . . , mj] is mapped to 
[x, z' , y'; (m'i, . . . , m' d )] . In words, the automaton creates a new memory state, output, and (outgoing) messages at 
each iteration, but does not change the initial value. 

We construct a dynamical system out of the above elements as follows. Let d(i) be the degree of node i. 
Node i begins with an initial value Xj £ X; it implements the automaton A^i), initialized with x = Xi, and with 
z = y = mi = ... = m d = 0- We use Si(t) = [xi,yi(t), Zi(t),mi t i(t), . . . ,Tnid(i)(t)] to denote the state of the 
automaton implemented by agent i at round t. Let ji,. . . ,jdU) be an enumeration of the neighbors of i, and let pk 
be the port number of the link (jk, i). The evolution of the system is then described by 

[xi,Zi(t+l),yi(t + l);m t j(t + 1), . . . ,™ MW (t + 1)] = A d[i) [x,Zi(t),yi(t);mj UPl (t), . . .,m jd!Pi (t)) . 

In words, the messages "sent" by the neighbors of i into ports leading to i are used to transition to a new state and 
create new messages that i "sends" to its neighbors at the next round. We say that y* £ Y is the final output of this 
dynamical system if there is a time t' such that j/j(t) = y* for every i and t > t' . 

Consider now a family of functions (f n )neM '■ X n —> Y . We say that such a family is computable if there exists 
a family of automata (Ad)d=i,2.... such that for any n, for any connected graph G = (V, E) with n nodes, any port 
labelling, and any set of initial conditions xi, . . . , x n , the final output of the above system is always f n {xi, ■ ■ ■ , x n ). 

In some results, we will also refer to function families (f n )nej\f computable with infinite memory, by which we 
mean that the internal memory sets Zd and output set Y are countable, the rest of the model being unaffected. 

We study in the sequel the general function computation problem: What families of functions are computable, 
and how can we design the automata Ad to compute them? 

3 Necessary condition for computability 

Let us first state the following lemma which can easily be proved by induction on time. 

Lemma 3.1. Suppose that G = ({1, . . . , n}, E) and G' = ({1, . . . , n}, E') are isomorphic, that is, there exists a 
permutation 7r such that £ E if and only if 7r(j)) € E' . Further, suppose that the port label at node i for 

the edge leading to j in G is the same as the port label at node for the edge leading to 7r(j) in G' . Then, the state 
Si(t) resulting from the initial values Xi,...,x n with the graph G is the same as the state S n ^(t) resulting from the 
initial values . . . , x v i n \ with the graph G' . 

Proposition 3.1. Suppose that the family {fi(xi), f2(xi, X2), fs{xi, x%, £3), . . .} is computable with infinite mem- 
ory. Then, each fi is invariant under permutations of its arguments. 

Proof. Let Wij be permutation that swaps i with j; with a slight abuse of notation, we also denote by 7Ty the mapping 
from X n to X n that swaps the ith and jth elements of a vector. We show that for all x £ X n , f n (x) = f n (^ij(x)). 

We run our distributed algorithm on the n-node complete graph. Consider two different initial configurations: 
(i) starting with the vector x; (ii) starting with the vector nij(x). Let the way each node enumerates his neighbors 
in case (i) be arbitrary; in case (ii), let the enumeration be such that the conditions in Lemma 13.11 are satisfied, 



which is easily accomplished. Since the limiting value of yi in one case is f(x) and in the other is f(iTij(x)), we 
obtain f(x) = f(itij(x)). Since the permutations -k^ generate the group of permutations, permutation invariancc 
follows. □ 

Let x G X n . We will denote by x 2 the concatenation of x with itself, and, generally, x k the concatenation of k 
copies of x. We now prove that self-concatenation does not affect the value of a computable family of functions. 

Proposition 3.2. Suppose that the family {fi(xi), f2( x i, ^2)1 fz{x\, X2, £3), ■ ■ ■} is computable with infinite mem- 
ory. Then, for every m > 2, every sequence x G X m , and every positive integer k, 

fm{x) = fk,n{x k ). 

Proof. Consider a ring of agents of size m, where the zth agent counterclockwise begins with the zth element of x; 
and consider a ring of size km where the agents i, i + m, i + 2m, . . . (counterclockwise) begin with the ith element of 
x. Suppose further that each node enumerates its two neighbors so that the neighbor on the left is labelled 1, while 
the neighbor on the right is labelled 2. See Figure [T] for an example with m = 3, k = 2 and Xi = i. 




Figure 1: Example of two situations that are indistinguishable by the nodes. 

Initially, the state of node i in the first ring is exactly the same as the state of the nodes j — i,i + m,i + 2m, . . . 
in the second ring. We show by induction that this property must hold at all times t. (To keep notation simple, we 
assume, without any real loss of generality, that i ^ 1 and i ^ m.) 

Indeed, suppose this property holds up to time t. At time t, node i in the first ring receives a message from node 
i — 1 and a message from node i + 1; and in the second ring, node j satisfying j (mod m) = i receives one message 
from j — 1 and j + 1. Since j — 1 (mod m) = i — 1 and j + 1 (mod to) = i + 1, the states of j — 1 and i — 1 are 
identical at time t, and similarly for and j + 1 and i + 1. Thus the messages received by i (in the first ring) and j 
(in the second ring) at time t are identical. Since i and j were in the same state at time t, they must be in the same 
state at time t + 1. This proves that they are in the same state forever. 

It follows that yi(t) = yj(t) for all t, whenever j (mod to) = i, and therefore f m (x) = fkm(x k ). □ 

We can now prove our main negative result, stating that if a family of functions is computable, then the value of 
the function only depends on the frequencies of the different possible initial values. We define D = {(pi, ■ ■ ■ ,Pk) 6 
[0, 1] K : y^fr—i Pk = 1}, which we call the proportion set. We say that a function h : D — > Y corresponds to a family 
(/„ : X" -> Y) if for every x G X n , 

f(xi, . . .,x n ) = h(p 1 (x 1 , . . .,x n ), . . . ,p K {xi, ■ ■ .,x„)) , 

where 

p k (xi, . . . ,x n ) = \{i I Xi = k, }\/n, 

so that pk(xi, . . . , x n ) is the frequency of occurrence of the initial value k. In this case, we say that the family (/„) 
is proportion-based. 

Theorem 3.1. Suppose that the family (/„) is computable with infinite memory. Then, this family is proportion- 
based. 

Proof. Let x and y be two sequences of n and to elements, respectively, such that Pk{xi, . . . , x n ) = Pk{yi, ■ • ■ , y m ) ='■ 
Pk for k = 1, . . . , K, that is, the number of occurrences of k in x and y are npk and mp^, respectively. Observe that 
for any k G X, the vectors x m and y n have the same number mn of elements, and both contain mnpk occurrences 
of k. The sequences y n and x m can thus be obtained from each other by a permutation, which by Proposition 13.11 
implies that f nm (x m ) = f nm (y n )- It then follows from Proposition [321 that fnm(x m ) = f n (x) and f mn (y n ) = f m (y), 
so that fn(x) = f m (y)- This proves that the value of f n (x) is determined by the vector (pi(x), . . . ,p n (x)). □ 



The following examples illustrate this result. 

(a) The parity function X)"=i x i (mod k) is not computable, for any k > 0. 

(b) In a binary setting (X = {0,1}), checking whether the number of nodes with Xi = 1 is at least 10 plus the 
number of nodes with Xi = is not computable. 

(c) Solitude verification, i.e. checking whether \i : {xi = 0}| = 1, is not computable. 

(d) An aggregate difference functions such as J2i<j \ x i ~ x j\ ^ s n °t computable, even calculated modulo k. 

4 Reduction of generic functions to the computation of averages 

In this section, we show that the computability question for large classes of functions reduces to the computability 
question for a particular averaging-like function. Namely, we will make use of the following theorem. 

Theorem 4.1. Let X = {0, . . . ,K} and define Y to be following set of single-point sets and intervals: 

Y = {{0}, (0, 1), {1}, (1, 2), . . . , {K - 1}, (K - 1, K), {K}} 

(or equivalently, an indexing of this finite collection of intervals). Let (f n ) be the following family of functions: f n 
maps (xi, X2, ■ ■ ■ , x n ) to the element of Y which contains the average J^. Xi/n. Then, the family (f n ) is computable. 

The proof of Theorem 14.11 is fairly involved and too long to be included in this extended abstract; however, we 
give an informal description of the algorithm for computing /„ in Section [5] In this section, we show that Theorem 
14.11 implies the computability of a large class of functions. We will say that a function h on the proportion set is 
computable if it it corresponds to a proportion-based computable family (/„). The level sets of h are defined as the 
sets L(y) = {p G D \ h(p) = y}, for y <E Y. 

Theorem 4.2 (Sufficient condition for computability). Let h be a function from the proportion set D to Y . 
Suppose that every level set L(y) can be written as a finite union, 

L(y)=\JC hk , 

k 

where each Ci y k can in turn be written as a finite intersection of linear inequalities of the form 

aipi + a 2 p2 H h olkPk < a, 

or 

ot\Pi + a 2 p 2 H h a K p K < a, 

with rational coefficients a, ot\, . . . , cxk- Then, h is computable. 

Proof. Consider one such linear inequality. Let P be the set of indices i for which > 0. Since all coefficients are 
rational, we can clear the denominators and rewrite the inequality as 

- < /?, (4.i) 

keP i£P c 

for nonnegative integers (3k and (3. Let %fe be the indicator function associated with initial value k, i.e., Xk(i) — 1 if 
Xi = k, and Xk(i) = otherwise, so that Pk = ^ X^i Xk(i)- Then, (|4.ip . becomes 

\ E ( E &x*(o + E M 1 x*(0) ) < z 3 + E 

i=l \keP k£P c / k£P c 

or 

-E*<<a 

where q t = J2keP PkXk{i) + EfeeP- /M 1 ~ Xk{i)) and q* = (3 + J2keP<= Pk- 

To determine if the latter inequality is satisfied, each node can compute qi and q* , and then apply a distributed 
algorithm that computes i X)™=i which is possible by virtue of Theorem 14.11 To check any finite collection of 
inequalities, the nodes can perform the computations for each inequality in parallel. 

To compute h, the nodes simply need to check which set L(y) the frequencies pi, ■ ■ ■ ,pk he in, and this can 
be done by checking the inequalities defining each L(y). All of these computations can be accomplished with finite 
automata: indeed, we do nothing more than run finitely many copies of the automata provided by Theorem 14. 1\ one 
for each inequality. □ 



Theorem 14.21 shows the computability of functions h whose level-sets can be defined by linear inequalities with 
rational coefficients. On the other hand, it is clear that not every function h can be computable. (This can be shown 
by a counting argument: there are uncountably many possible functions h, but for the special case of bounded degree 
graphs, only countably possible algorithms.) Still, the next lemma shows that the set of computable functions is rich 
enough, in the sense that such functions can approximate any measurable function. 

We will call a set of the form rHfc=i( a fci &fc)j with every a,k,bk rational, a rational open box, where Y\ stands for 
Cartesian product. A function that can be written as a finite sum a^l^, where the B>i are rational open boxes 
and the 1 s i are the associated indicator functions, will be referred to as a box function. Note that box functions are 
computable by Theorem l4.21 

Corollary 4.3. If every level set of a function h : D — > Y on the proportion set is Lebesgue measurable, then, 
for every e > 0, there exists a computable box function h e : D — > Y such that the set {h ^ h e } has measure at most e. 

Proof. The proof relics on the following elementary result from measure theory. Given a Lebesgue measurable set 
E 6 [0, 1] K and some e > 0, there exists a set E 1 which is a finite union of disjoint open boxes, and which satisfies 

fi((E - E') U (E' - E)) < e, 

where fi is the Lebesgue measure. By a routine argument, these boxes can be taken to be rational. By applying this 
fact to the level sets of the function h (assumed measurable) , the function h can be approximated by a box function 
h e . Since box functions are computable, the result follows. □ 

The following corollary states that quantizations of continuous functions arc approximablc. 

Corollary 4.4. If a function h : D — > [L, U] C 3? is continuous, then for every e > there exists a computable 
function h e : D—> [L, U] such that \\h — /i e ||oo < e 

Proof. Since D is compact, / is uniformly continuous. One can therefore partition D into a finite number of 
subsets Ai, A%, . . . , A q , that can be described by linear inequalities with rational coefficients, so that maxpg^ h{p) — 
min p6 ^. h(p) < e holds for all Aj. The function h e is then built by assigning to each Aj an appropriate value in 
{L,L + e,L + 2e,...,U}. □ 

Finally, we show that with infinite memory, it is possible to recover the exact frequencies pk- (Note that this 
is impossible with finite memory, because n is unbounded, and the number of bits needed to represent pk is also 
unbounded.) 

Theorem 4.5. The vector (pi, . . . ,px) is computable with infinite memory. 

Proof. We show that p\ is computable exactly, which is sufficient to prove the theorem. Consider the following 
algorithm, parametrized by a positive integer to. The initial set X m will be {0, 1, . . . , to} and the output set Y m will 
be as in TheoremgT] Y m = {{0}, (0, 1), {1}, (1, 2), {2}, (2, 3), . . . , {m - 1}, (m - 1, to), {to}}. If Xi = 1, then node 
sets its initial value Xi^ m to to; else, the node sets its initial value Xi tTn to 0. The algorithm computes the function 
family (/„) which maps to the element of Y m containing (1/n) x i.m, which is possible, by Theorem 14.11 
We will call this algorithm Q m . Let y m be its final output. 

The nodes run the algorithms Q m for every positive integer value of to, in an interleaved manner. Namely, at 
each time step, a node runs one step of a particular algorithm Q m , according to the following order: 

Qi, Qi,Q2, Qi,Q2,Q37 Qi,Q2,Q3,Q4, QxiQi-,--- 

At each time t, let m,i(t) be the smallest to (if it exists) such that the output y.i, m {t) of Q m at node i is a 
singleton (not an interval). We identify this singleton with the numerical value of its single element, and we set 
= yi,m,i(t){t)/ m i{t)- If wij(£) is undefined, then yi(t) is set to some default value. 

It follows from the definition of Q m and from Theorem 14 . 1 1 that there exists a time after which the outputs yi^ m 
of the algorithms Q\, . . . , Q n do not change, and are the same for every node, denoted y m . Moreover, at least one 
of these algorithms has an integer output y m . Indeed observe that Q n computes (1/n) X)"=i n lxi=i = S™=i lx$=i; 
which is clearly an integer. In particular, rrii(t) is eventually well-defined and bounded above by n. We conclude 
that there exists a time after which the output of our overall algorithm is fixed, shared by all nodes, and different 
from the default value. 

We now argue that this value is indeed p\. Let to* be the smallest m for which the eventual output of Q m is a 
single integer y m . Note that y m * is the exact average of the x.^ m * , i.e. y m » = i X)"=i Tn* l Xi =i = m*pi. For large t, 
we have y t (t) = yi, m *(t)/m* = pi. 



Finally, it remains to argue that the algorithm described here can be implemented with a sequence of automata. 
All the above algorithm does is run a copy of all the automata implementing Qi,Q2, • • ■ with time-dependent 
transitions. This can be accomplished with an automaton whose state space is the countable set N x U$^ =1 n^Li 2«j 
where Qi is the state space of Qi, and the set M of integers is used to keep track of time. □ 

To illustrate the results of this section, let us consider again some examples. 

(a) Majority testing between two options is equivalent to checking whether p\ < 1/2, with alphabet {0, 1}, and is 
therefore computable. 

(b) Majority testing when some nodes can "abstain" amounts to checking whether pi — P2 > 0, with alphabet 
{0, 1, abstain}. This function family is computable. 

(c) We can ask for the second most popular value out of four, for example. In this case, the sets Ai can be 
decomposed into constituent sets defined by inequalities such as p 2 < p$ < Pa < Pi , each of which obviously 
has rational coefficients. 

(d) For any subsets 1,1' of {I, . . . , K}, the indicator function of the set where JZieJ Pi — Yliei' Pi * s computable. 
This is equivalent to checking whether more nodes have a value in / than do in 

(c) The indicator functions of the sets defined by p\ < 1 /2 and p\ < ir/4 arc measurable, so they are approximablc. 

Wc arc unable to say whether they are computable, 
(f) The indicator function of the set defined by P1P2 < 1/8 is approximablc, but we are unable to say whether it 

is computable. 

5 A sketch of the proof of Theorem 14.11 

In this section, we sketch an algorithm for computing the average of integer initial values, but omit the proof of 
correctness. We start with an important subroutine that tracks the maximum (over all nodes) of time- varying inputs 
at each node. 

5.1 Distributed maximum tracking 

Suppose that each node i has a time- varying input Ui(t) stored in memory at time t, belonging to a finite set of 
numbers U. We assume that, for each i, the sequence Ui{t) must eventually stop changing, i.e., that there exists 
some T" such that 

Ui(t) = Ui(T'), foralU>T'. 

(However, the nodes need not be ever aware that Ui(t) has reached its final value.) Our goal is to develop a distributed 
algorithm whose output eventually settles on the value maxiUi(T'). More precisely, each node i is to maintain a 
number Mi(t) which must satisfy the following constraint: for every connected graph and any allowed sequences 
Ui(t), there exists some T" with 

Mi(t) = max Ui(t), for all t > T". 

i— l,...,n 

Moreover, node i must also maintain a pointer Pi(t) to a neighbor or to itself. We will use the notation Pf{t) = 
P P .( t }(t), Pf(t) ~ Pp2( t )(i), etc. We require the following additional property, for all t larger than T": for each node 

i there exists a node j and a power K such that for all k > K we have P^ = j; moreover, Mi(t) = Uj(t). In other 
words, by successively following the pointers Pi(t), one can arrive at a node with the maximum value. 

Theorem 5.1. An algorithm satisfying the above conditions exists and can be implemented at each node with a 
finite automaton whose state can be stored using at most C(log \U\ + d(i)) bits, for some absolute constant C . 

Wc briefly summarize the algorithm guaranteed by this theorem. Each node i initially sets M,(0) = u,(0), Pi(0) = 
i. Nodes exchange their values itj(f) and forward the largest value they have seen; every node sets its estimated 
maximum Mi(t) equal to that largest value, and sets its pointer Pi to the node that forwarded that value to i. When 
some Ui changes, the corresponding node sends out a reset message, which is then forwarded by all other nodes. The 
details of the algorithm and its analysis are somewhat involved because we need to make sure that the reset messages 
do not cycle forever. 



5.2 The averaging algorithm 

We continue with an intuitive description of the averaging algorithm. Imagine the initial integer values Xi as 
represented by x t pebbles. Our algorithm attempts to exchange pebbles between nodes with unequal number of 
pebbles so that the overall distribution becomes more even. Eventually, cither all nodes will have the same number 
of pebbles, or some will have a certain number and others just one more. We let Ui(t) be the current number of 
pebbles at node i\ in particular, it,(0) = X{. An important property of the algorithm is that the total number of 
pebbles is conserved. 

To match nodes with unequal number of pebbles we use the maximum tracking algorithm of Section 15.11 Recall 
that the algorithm provides nodes with pointers which attempt to track the location of the maximal values. When a 
node with itj pebbles comes to believe in this way that a node with at least Ui + 2 pebbles exists, it sends a request in 
the direction of the latter node to obtain one or more pebbles. This request follows a path to a node with a maximal 
number of pebbles, until the request either gets denied, or gets accepted by a node with at least Ui + 2 pebbles. 

More formally, the algorithm uses two types of messages: 

(a) (Request, r): This is a request for a transfer of value. Here, r is an integer that represents the current number 
of pebbles at the emitter. 

(b) (Accept, w): This corresponds to acceptance of a request, and subsequent transfer of w pebbles to the requesting 
node. A request with a value w = represents a request denial. 

As part of the algorithm, the nodes run the maximum tracking algorithm of Section 15. li as well as a minimum 
tracking counterpart. In particular, each node i has access to the variables Mj(i) and Pi(t) of the maximum tracking 
algorithm (recall that these are, respectively, the estimated maximum and a pointer to a neighbor). Furthermore, 
each node maintains three additional variables. 

(a) "mode"g {free, blocked}. Initially, the mode of every node is free. Nodes become blocked when they are 
handling requests. 

(b) "i?m 2 (i)", u Routi(t)" arc pointers to a neighbor of i or to itself. They represent, respectively, the node from 
which i has received a request, and the node to which i has transmitted a request. Initially, Ririi(0) = 
Routi(0) = 0. 

The algorithm is described in Figure [2j 

A key step in the proof of Theorem 14. II is the following proposition, whose proof is omitted. 

Proposition 5.1. There is a time t' such that Ui(t) = Ui(t'), for all i and t > t' . Moreover, 

I>i(*') = X)ui(O) = 

i i i 

\ u i{t') ~ u j{t')\ < 1> for alii, j. 

We now conclude our sketch of the proof of Theorem 14. II Let u* be the value that Ui(t) settles on. It follows from 
Proposition 15.11 that if the average x of the inputs Xi is integer, then Ui(t) = u* = x will eventually hold for every 
i. If x is not an integer, then some node will eventually have u* = \x\ and others u* = \x\. Using the maximum 
and minimum computation algorithm, nodes will eventually have a correct estimate of max^ u* and min^ u* , because 
each Ui(t) converges to u*. This allows the nodes to determine if the average is exactly x (integer average), or if it 
lies in (u*,u* + 1), or (u* — l,u*), which is the property asserted by Theorem 14. II 
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